Stochastic Models for Fault Tolerance - Restart, Rejuvenation and Checkpointing
نویسنده
چکیده
It's coming again, the new collection that this site has. To complete your curiosity, we offer the favorite stochastic models for fault tolerance restart rejuvenation and checkpointing book as the choice today. This is a book that will show you even new to old thing. Forget it; it will be right for you. Well, when you are really dying of stochastic models for fault tolerance restart rejuvenation and checkpointing, just pick it. You know, this book is always making the fans to be dizzy if not to find.
منابع مشابه
Modeling software systems with rejuvenation, restoration and checkpointing through fluid stochastic Petri nets
In this paper, we present a Fluid Stochastic Petri Net (FSPN) based model which captures the behavior of aging software systems with checkpointing, rejuvenation and self-restoration, three well known techniques of software fault tolerance. The proposed FSPN based modeling framework is novel in many aspects. First, the FSPN formalism itself, as proposed in [24], is extended by adding ush-out arc...
متن کاملStability Assessment Metamorphic Approach (SAMA) for Effective Scheduling based on Fault Tolerance in Computational Grid
Grid Computing allows coordinated and controlled resource sharing and problem solving in multi-institutional, dynamic virtual organizations. Moreover, fault tolerance and task scheduling is an important issue for large scale computational grid because of its unreliable nature of grid resources. Commonly exploited techniques to realize fault tolerance is periodic Checkpointing that periodically ...
متن کاملImproved Checkpoint / Restart Using Solid State Disk Drives
Fault tolerance and reliability of distributed systems is often achieved through checkpoint / restart mechanisms. Checkpointing frequency and restart delay crucially depend on data throughput and access performance of the storage medium. In this paper we discuss the opportunity to achieve subsecond checkpointing frequencies and restart delays by substituting magnetic hard disk storage with soli...
متن کاملCRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance
In order to efficiently use the future generations of supercomputers, fault tolerance and power consumption are two of the prime challenges anticipated by the High Performance Computing (HPC) community. Checkpoint/Restart (CR) has been and still is the most widely used technique to deal with hard failures. Application-level CR is the most effective CR technique in terms of overhead efficiency b...
متن کاملHigh-Level Fault Tolerance in Distributed Programs
We have been developing high-level checkpoint and restart methods for Dome (Distributed Object Migration Environment), a C++ library of data-parallel objects that are automatically distributed using PVM. There are several levels of programming abstraction at which fault tolerance mechanisms can be designed: high-level, where the checkpoint and restart are built into our C++ objects, but the pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010